|
|
Accession Number |
TCMCG021C29796 |
gbkey |
CDS |
Protein Id |
XP_019702717.1 |
Location |
complement(join(10881..12489,15207..15661,15792..16547,19040..19126,19262..19570,20317..20491,21949..22070,65379..65425,75002..75251)) |
Gene |
LOC105034972 |
GeneID |
105034972 |
Organism |
Elaeis guineensis |
|
|
Length |
1269aa |
Molecule type |
protein |
Topology |
linear |
Data_file_division |
PLN |
dblink |
BioProject:PRJNA268357 |
db_source |
XM_019847158.2
|
Definition |
nuclear pore complex protein NUP1 isoform X1 [Elaeis guineensis] |
CDS: ATGGCGACGGCGGGCTACGGGGGAGGGATCGGAGGGAAGTTCCGGCGGCGGCCGCATAGGAGGGCCGCGACGACGCCGTATGACCGCCCGCCGGCGGTGGCTCGCGGGCTTAGGGGCCGGACGGCGGAGGCGGGACGCAACGGGTGGCTCTCGAAGCTCGTCGATCCCGCCTCGCGGTTCATCGCCAGCAGCACCTCCAGGATCTTCTCCTCTGTGTTCGGCAAGCGTCTTGCGCTGCCGGAAGCTCCAGAAGGAAACCATCAGTCGAGTCAAGAAGCTCCTGAAGTAGCTTGCACTCTGCTGTATCCATCATCCAAATTGCTAGAAAATGAAAGAAATGGGGCTGAGCAAACGAATAATCCTGATACCAGTGGTATTTCAGAACTTGAAGAACTACTGAAGCAGAAAACATTCACGAGGGTTGAGTTTGATTATTTGACACAATTACTGCGCTCAAGAACTTTTGAGCCAGACAGGTCAAAACCTACAACCAATAAGGAAGAAAAGGAAGAGACAGTTGTTTCAGCACAAGATAATGGAGTAGGATGTTCCAAATCATTTCAGGATTTCTCAACTCCTTCCAAAAGTTTAATGATTCCTGAGGTTGAAGCTGCTTCACCAGCAGAACTTGCCAAAGCATATATGGGGTCTAGATCTTCAAAAGTGCCCTTTTCTGCTCTAAGTTTGCGGAGCCAAGTTTTTCGTGGAGATAAGACAATGCCAAGAAATGCACCGTGTATATCAAAACCATTTGACCCATCCGTCGTACCAAAATCTGTTGTTCGATTTTCTGAAACCCCTGATCTCCCTGAAAATGGTTACATGGCACCAAGAGGCAGATCAGCCATATACAGAATGTCCCGTTCTCCATATTTTAAGGCCCATCCTACAACAAACGTGAAGGGTGCAGGACCTTCCAACAATGTCTCTCATGGCCCGTCATCATCTCACCAGACTCTAGCAAGCACCATGCATTCTGGTGGCAGGCAGGTCTTGAACCAAGGTAGTTCAGCTTTAGATGGTGATTTTGGATCTGTTGGTCCTATACGTAGAATTCGCCAGAAGTCCAATATGATATCTCCTACAAAAGATATACGCTTGAAATTTCCTGGAAATCTTCCTCCTAGTCCCTCGACCCCACTTGACAAAGGTTTTATCCAAGGTTCTGCGTCAATGCAAGAGACTGTTGGGCTGGATGACCAGAAGCATGATAGCATAGACTTACAGAGTTCAGAAAAGGGAAATAATAGAAAATCTTATGAAAATATTGTTTCCAGTCCACTGCAGTCAAGAGAGACTGCCAAGAAAATATTGCAACAGCTTGATAAATTAGTGCCTTCACCAAAAGAAAGGTCATCTGAACCAAACACTTTTTCTAGAGATGAATCACCTTCAAAGTTAATGCATATTTCTTTACGTGGACAGGTTTTTGAAAACATGAAGGATATAAATTCATCTAAGCCTCTAGATAAGGAAGGCAATGATAATTTGGATGCTGTCGATGATTCCCTTCTGCTAGATATCAGAAATACACCTCCTCAAAATCCAGTCAAGGTAGAAGAAAATGGCCCAATAAAATCTTCTGTTTCAGGAGTTAAATCGGCATCTGAAAGCAATAGCGCGGATGATGCTCTTGTACGTGTCGCAGATTTCATGCCTGGTAACAGTTCTGCACATGTTGGGATCTCAGATTCTGCTGCTTTTCCCTCACAAAAGAATCCAGCTTTTAAAATGATTGCACCGGAGGATTCACTGGATCTGGATGATGATAATAACAACAAGGACACTTCTGGTCCAGTAAGCACTATAGTGGATAACGTTGAGCTGAAGATATTAAAGCATGAAGATGTTACTTCTGAGCCAGTAACAGCAAAGAGATTTCTGATAGCTTCATCCAAGTACACACCTTTTCCGAGTTCCATATCTGCTGGAGAAGCTGATATGAAGGGTTCTGTTGGACCTGTGATTTCTGAGAAAAGCACTGGCTTTACATTTCCTGTCGCACCTGCTCCCAGCATTCATTCTCAGCCACCTCCAAAACCTGCCATGCCAAGCCCTCTGGTTGACAGACCAGCTTCTCAGAAGGAACAAACAGCTGCTCCTTTTAGTTTTGGCTCCAAGGATGAACCGACCTTTACATTTTCATCAACTGTGAGTACGACAGGTTTCAGTGAGACTGATGGCCTGAAAATTGGTGTGAGCAATGACAGTTCATCAATTGTAGATAACAACTCAAAGTTGAATCTATGTGGTGAAATGCAGCAGGCTGGAGATTTGATCAAATCAGTTGGAACAGAAGTTTCCTCAGTCATATCAACATCAACAACACCTCGTGTCTTTGCTTTTGGTGCTTCGACTGCTCCAAGCTTAAGCAATGGCTCACTGCGCTCATCCTCTAACTTCTCATTTTCCACTTCTCCAGCTGTGACATTTTCTGCTGGTACTGCAAGCTCGATTTTCTCTACCAGCCCTTCCAGTGCAACTGGTTGCAATGGTTTATCCGTATCACCTGCAGCACCTATATTTTCCATGGTTCCCGTACTTCAGTTTGGGTCCAGTACCTCAAAAGGGTTTCTGATATCAGTTTCTTCCCAGTATAAATCAGATAACATGGATATGGGGGCAAAGCCTACCAAGGCATCACCATTCAGCTTAAATAGTTCTGCTCAGGGCACATTCTCATTTTCAAGCACAGGCAGCAGCAATTCGTCTGCTCTGACAGTACCATCTGCATTTTCTAACACCGGCAGTGACTTATCTGTTGCGGCAACATTATGTGCAAGTTCAAGCATGGGCAGCAGTTCCACTGCTCCTGCCCCAAGTATGAGTGCCAGCCAATCTGTCCTTGGACCATCATCAGCATTTTCAGATACAACCAGCATTTCTGGGTTCAGTTCTTCAGGGCAGTCCGGTAGTTTGAGTCCATCTGTTGCAGCTAGCAATTCTCAGAACTTTGCTGCTAGTTTTGGTGCTACAACTGCATCTTTTGGCATACAGTCAACCCAAACCGGAAGTTGGGTCTCACACATTTCTCAAAGCTCTGCGAGTCCATTTGGTCCTTCTCTGTCAGCTCCGACATTTGGACTCAGTGCTACTTCCTCATCTGGTTTTGGCAGTTCACCATTTGGGCATGCATCTGGTACTAAATCTTTTAGTTCAAGTTCTGGATTCTCTGTTTCAGCTGGTGCCAACTCTTCCAGCCCTGGGACCAGCTCTTCTGCAGCTACCACTAGTTTGTTCAGTTCAAGTTCCCTACCATCCACATCATCTGTCTTCGGTACCGGTTTTGGATCTAGTGTGTCCCCATCTACTGGATTTTCATTTGGACTGTCTACATCTGCTTTAGGGAGTTCATCTACATTTGGTTCATCATCAGGCTCAGCATTCTCATTCACTTCAGCTGGTTCTACTCCAACGCCACTTTTTTCTGCGCAACCTGTATTTGGAATGTCCACTGCAGCTGCTGGTTTTAGCTCAGGATCTACTGGAACTGATCAGATGAATGTCGAGGACAGCATGGCTGATGACACTAACCAAGCAGAGGTTTCTATGGTTTCGGCATTTGCTCAACCAAGCAGTTCACTTGCGGCACCAGTCTTTGGTGCTCCAGCAAATTTGTCAGGTGGATCACCCATTTTCCAATTTGGTAGCCATCTGAATTCTTCTATCCCTCAAAATCCATCCCCATTTCAGGCAGCTGGTAATCTAGAGCTTCCTCCGGGAGGAAGCTTTTCTTTGGGTAGTGGTGGTGGGGACAAGTCTGGCCGAAAATTTGTGAAAGTAAGACGAGACAAGCAACGAAAGAAATAA |
Protein: MATAGYGGGIGGKFRRRPHRRAATTPYDRPPAVARGLRGRTAEAGRNGWLSKLVDPASRFIASSTSRIFSSVFGKRLALPEAPEGNHQSSQEAPEVACTLLYPSSKLLENERNGAEQTNNPDTSGISELEELLKQKTFTRVEFDYLTQLLRSRTFEPDRSKPTTNKEEKEETVVSAQDNGVGCSKSFQDFSTPSKSLMIPEVEAASPAELAKAYMGSRSSKVPFSALSLRSQVFRGDKTMPRNAPCISKPFDPSVVPKSVVRFSETPDLPENGYMAPRGRSAIYRMSRSPYFKAHPTTNVKGAGPSNNVSHGPSSSHQTLASTMHSGGRQVLNQGSSALDGDFGSVGPIRRIRQKSNMISPTKDIRLKFPGNLPPSPSTPLDKGFIQGSASMQETVGLDDQKHDSIDLQSSEKGNNRKSYENIVSSPLQSRETAKKILQQLDKLVPSPKERSSEPNTFSRDESPSKLMHISLRGQVFENMKDINSSKPLDKEGNDNLDAVDDSLLLDIRNTPPQNPVKVEENGPIKSSVSGVKSASESNSADDALVRVADFMPGNSSAHVGISDSAAFPSQKNPAFKMIAPEDSLDLDDDNNNKDTSGPVSTIVDNVELKILKHEDVTSEPVTAKRFLIASSKYTPFPSSISAGEADMKGSVGPVISEKSTGFTFPVAPAPSIHSQPPPKPAMPSPLVDRPASQKEQTAAPFSFGSKDEPTFTFSSTVSTTGFSETDGLKIGVSNDSSSIVDNNSKLNLCGEMQQAGDLIKSVGTEVSSVISTSTTPRVFAFGASTAPSLSNGSLRSSSNFSFSTSPAVTFSAGTASSIFSTSPSSATGCNGLSVSPAAPIFSMVPVLQFGSSTSKGFLISVSSQYKSDNMDMGAKPTKASPFSLNSSAQGTFSFSSTGSSNSSALTVPSAFSNTGSDLSVAATLCASSSMGSSSTAPAPSMSASQSVLGPSSAFSDTTSISGFSSSGQSGSLSPSVAASNSQNFAASFGATTASFGIQSTQTGSWVSHISQSSASPFGPSLSAPTFGLSATSSSGFGSSPFGHASGTKSFSSSSGFSVSAGANSSSPGTSSSAATTSLFSSSSLPSTSSVFGTGFGSSVSPSTGFSFGLSTSALGSSSTFGSSSGSAFSFTSAGSTPTPLFSAQPVFGMSTAAAGFSSGSTGTDQMNVEDSMADDTNQAEVSMVSAFAQPSSSLAAPVFGAPANLSGGSPIFQFGSHLNSSIPQNPSPFQAAGNLELPPGGSFSLGSGGGDKSGRKFVKVRRDKQRKK |